INDIGO – INtegrated Data Warehouse of MIcrobial GenOmes with Examples from the Red Sea Extremophiles
نویسندگان
چکیده
BACKGROUND The next generation sequencing technologies substantially increased the throughput of microbial genome sequencing. To functionally annotate newly sequenced microbial genomes, a variety of experimental and computational methods are used. Integration of information from different sources is a powerful approach to enhance such annotation. Functional analysis of microbial genomes, necessary for downstream experiments, crucially depends on this annotation but it is hampered by the current lack of suitable information integration and exploration systems for microbial genomes. RESULTS We developed a data warehouse system (INDIGO) that enables the integration of annotations for exploration and analysis of newly sequenced microbial genomes. INDIGO offers an opportunity to construct complex queries and combine annotations from multiple sources starting from genomic sequence to protein domain, gene ontology and pathway levels. This data warehouse is aimed at being populated with information from genomes of pure cultures and uncultured single cells of Red Sea bacteria and Archaea. Currently, INDIGO contains information from Salinisphaera shabanensis, Haloplasma contractile, and Halorhabdus tiamatea - extremophiles isolated from deep-sea anoxic brine lakes of the Red Sea. We provide examples of utilizing the system to gain new insights into specific aspects on the unique lifestyle and adaptations of these organisms to extreme environments. CONCLUSIONS We developed a data warehouse system, INDIGO, which enables comprehensive integration of information from various resources to be used for annotation, exploration and analysis of microbial genomes. It will be regularly updated and extended with new genomes. It is aimed to serve as a resource dedicated to the Red Sea microbes. In addition, through INDIGO, we provide our Automatic Annotation of Microbial Genomes (AAMG) pipeline. The INDIGO web server is freely available at http://www.cbrc.kaust.edu.sa/indigo.
منابع مشابه
Mining a database of single amplified genomes from Red Sea brine pool extremophiles—improving reliability of gene function prediction using a profile and pattern matching algorithm (PPMA)
Reliable functional annotation of genomic data is the key-step in the discovery of novel enzymes. Intrinsic sequencing data quality problems of single amplified genomes (SAGs) and poor homology of novel extremophile's genomes pose significant challenges for the attribution of functions to the coding sequences identified. The anoxic deep-sea brine pools of the Red Sea are a promising source of n...
متن کاملA catalogue of 136 microbial draft genomes from Red Sea metagenomes
Earth is expected to continue warming and the Red Sea is a model environment for understanding the effects of global warming on ocean microbiomes due to its unusually high temperature, salinity and solar irradiance. However, most microbial diversity analyses of the Red Sea have been limited to cultured representatives and single marker gene analyses, hence neglecting the substantial uncultured ...
متن کاملIMG 4 version of the integrated microbial genomes comparative analysis system
The Integrated Microbial Genomes (IMG) data warehouse integrates genomes from all three domains of life, as well as plasmids, viruses and genome fragments. IMG provides tools for analyzing and reviewing the structural and functional annotations of genomes in a comparative context. IMG's data content and analytical capabilities have increased continuously since its first version released in 2005...
متن کاملNitrogen and phosphorous budgets for integrated culture of Litopenaeus vannamei with red sea algae Gracilaria corticata under zero water exchange system
Abstract In this study, a 2×3 factorial design with two levels of shrimp density (25 and 50 shrimp per m-2) and three levels of red algae density (0, 200 and 400g per m-2) was applied to calculate nitrogen and phosphorous budgets in integrated culture of Litopenaeus vannamei with Gracilaria corticata during 45 days under zero water exchange system. Juvenile of L.vannamei (5.82 ± 0.11 g...
متن کاملThe lifestyle of prokaryotic organisms influences the repertoire of promiscuous enzymes.
The metabolism of microbial organisms and its diversity are partly the result of an adaptation process to the characteristics of the environments that they inhabit. In this work, we analyze the influence of lifestyle on the content of promiscuous enzymes in 761 nonredundant bacterial and archaeal genomes. Promiscuous enzymes were defined as those proteins whose catalytic activities are defined ...
متن کامل